Understanding the Fisher Vector: a multimodal part model

نویسندگان

  • David Novotný
  • Diane Larlus
  • Florent Perronnin
  • Andrea Vedaldi
چکیده

Fisher Vectors and related orderless visual statistics have demonstrated excellent performance in object detection, sometimes superior to established approaches such as the Deformable Part Models. However, it remains unclear how these models can capture complex appearance variations using visual codebooks of limited sizes and coarse geometric information. In this work, we propose to interpret Fisher-Vector-based object detectors as part-based models. Through the use of several visualizations and experiments, we show that this is a useful insight to explain the good performance of the model. Furthermore, we reveal for the first time several interesting properties of the FV, including its ability to work well using only a small subset of input patches and visual words. Finally, we discuss the relation of the FV and DPM detectors, pointing out differences and commonalities between them.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Achieving Multimodal Cohesion during Intercultural Conversations

How do English as a lingua franca (ELF) speakers achieve multimodal cohesion on the basis of their specific interests and cultural backgrounds? From a dialogic and collaborative view of communication, this study focuses on how verbal and nonverbal modes cohere together during intercultural conversations. The data include approximately 160-minute transcribed video recordings of ELF interactions ...

متن کامل

Text Sentiment Classification Based on Mixed Cloud Vector Model Clustering and Kernel Fisher Discriminant

In today’s world, the web has dramatically changed the way that people express their opinions. People use the internet to express their opinion, attitude, feeling and emotion about films, goods, news etc. It is challenging to automatically classify mass subjectivity comments into different sentiment orientation categories (e.g. positive/negative). Furthermore, the ambiguity and randomness, whic...

متن کامل

Recognizing Two Handed Gestures with Generative, Discriminative and Ensemble Methods Via Fisher Kernels

Use of gestures extends Human Computer Interaction (HCI) possibilities in multimodal environments. However, the great variability in gestures, both in time, size, and position, as well as interpersonal differences, makes the recognition task difficult. With their power in modeling sequence data and processing variable length sequences, modeling hand gestures using Hidden Markov Models (HMM) is ...

متن کامل

Capacitated Multimodal Structure of a Green Supply Chain Network Considering Multiple Objectives

In this paper, a supply chain network design problem is explained which contains environmental concerns in arcs and nodes of network. It is assumed that there are some routes such as road, rail and etc. in each pair of nodes. In this model decision variables are choosing facilities to open, environmental investment level in each facility and flow of products between nodes in each route. A multi...

متن کامل

Computation of Standard Errors for Maximum-likelihood Estimates in Hidden Markov Models

Explicit computation of the score vector and the observed information matrix in hidden Markov models is described. With the help of the information matrix Wald's con dence intervals can be formed for the model parameters. Finite sample properties of the maximum-likelihood estimator and its standard error are investigated by means of simulation studies. We compare the con dence levels of interva...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1504.04763  شماره 

صفحات  -

تاریخ انتشار 2015